Compiling Trace And Unification Grammar For Parsing And Generation
نویسنده
چکیده
This paper presents Trace & Unification Grammar (TUG), a declarative and reversible grammar formalism that brings together Unification Grammar (uG) and ideas of Government & Binding Theory (GB) in an undogmatic way. A grammar compiler is presented that transforms a grammar written in the TUG formalism into two different forms, one being useful for parsing, the other being useful for generation. 1 I N T R O D U C T I O N During the last years there has been a growing interest in NL systems that can be used for both parsing and generation. The invention of unification grammar that allows for a declarative description of language made it possible to use the same grammar for both tasks. The main goal of a grammar then is to describe a relation between normalized (semantic) representations and language strings. A grammar that can be used in both directions is called "reversible". We can distinguish three levels of reversibility. On the first level, not only the same grammar is used for parsing and generation, but also the interpreter for parsing and generation is reversible. This approach is taken in Shieber (1988). Besides elegance the approach has the advantage that the reversibility is guaranteed. Further advantages are mentioned in Neumann (1991). As a disadvantage, it is yet unclear whether and how these systems can be made efficient. On the second level we find systems where the same reversible grammar is processed by two different interpreters, one for parsing, one for generation. The advantage of these systems is that the grammar can be changed and tested easily, which helps to shorten the development cycle. The disadvantage again is that grammar interpreters are usually too slow to be used in realistic systems. On the third level we finally find systems, where the linguistic description is given in a reversible declarative grammar. This grammar is then compiled into two different forms, one being usefull only for parsing, the other only for generation. Whereas here we have to face the disadvantage that compiling can take some time and therefore prolongs the development cycle, the advantage lies in the efficient processing that can be achieved with compiled grammars. Strzalkowski (1990) and Strzalkowski/Peng (1990) describe a compiler that transforms a grammar originally written for parsing into an efficient generator. In the follwing section I will present a system of the third type and show by means of which compiling methods a grammar written in a perspicious formalism, TRACE AND UNIFICATION GRAMMAR (TUG) can be transformed to fast parsers and generators. The proposed compilers and their modular architecture have the further advantage that most of their parts can be used also for other formalisms than the one described, e.g. DCGS. The whole system is part of a polyfunctional linguistic processor for German called LINGUISTIC KERNEL PROCESSOR (LKP). The LKP contains a grammar of German with broad coverage. The grammar describes the relation between a subset of German and a subset of QLF, the intermediate semantic form that is used in the Care Language Engine of SRI Cambridge (Alshawi 1990). The LKP has been implemented in PROLOG. Parsing and Generation of a sentence up to 15 words normally takes between 1 and 10 seconds, with a strong tendency to the lower bound.
منابع مشابه
An experimental parser for syntemic grammars
We descrlbe a general parsing m e t h o d for systemic grammars. Systemic grammars contain a paradigmatic analysis of language in addition to structural information, so a parser must assign a set o f grammatical features and functions to each constituent in addition to producing a constituent structure. Our method constructs a parser by compiling systemic grammars into the notation of Functiona...
متن کاملSome Remarks on the Decidability of the Generation Problem in LFG- and PATR-Style Unification Grammars
In this paper, we prove the decidability of the generation problem for those unification grammars which are based on contextfree phrase structure rule skeletons, like e.g. LFG and PATR-II. The result shows a perhaps unexpected asymmetry, since it is valid also for those unification grammars whose parsing problem is undecidable, e.g. grammars which do not satisfy the off-line parsability constra...
متن کاملOn the Universal Generation Problem for Unification Grammars
The universal generation problem for unification grammars is the problem of determining whether a given grammar derives any terminal string with a given feature structure. It is known that the problem is decidable for LFG and PATR grammars if only acyclic feature structures are taken into consideration. In this brief note, we show that the problem is undecidable for cyclic structures. This hold...
متن کاملAn Algorithm For Generation In Unification Categorial Grammar
We present an algorithm for the generation of sentences from the semantic representations of Unification Categorial Grammar. We discuss a variant of Shieber's semantic monotonicity requirement and its utility in our algorithm. We indicate how the algorithm may be extended to other grammars obeying the same requirement. Appendices contain a full listing of the program and a trace of execution of...
متن کاملParsing Korean based on Dependency Grammar and GULP
This paper presents a parsing algorithm in Prolog using GULP, based on dependency grammar and unification-based grammar.1 It parses declarative sentences of a free-word-order language, Korean. The dependency grammar accepts free order of the words in a sentence. Unification-based features separate the grammar from the parsing algorithm and also simplify the notation of the grammar. GULP (Graph ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1991